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As  psychologists  bscosM  nore  Involvsd  in  bionsdiesl  rssssrch  issuss,  thpy  srs  increasingly  eonfrontsd  by  sn  amy  of 
•cthods  and  terns  Hhieh  srs  relatively  naw  to  than,  methods  and  terms  idiieh  11s  within  the  province  of  such  disciplines  as 
epldanlology  and  blostatistlcs.  The  SKposure  to  unfiunlllar  terma  and  methods  can  rssult  in  uncertainty  and  limit  the  effse- 
tivanass  of  psychologists  in  blonadical  research. 


tihi^ive 
■^The  . 


'^The  objective  of  this  study  was  to  examine  the  methods  comaonly  employed  in  epidemiologic  and  bloasdlcal  research  and 
to  relate  than  to  the  methods  traditionally  used  in  the  field  of  psychology.  Ths  intent  of  this  report  is  to  describe  saow 
of  these  methods  and  mplain  them  in  a  clear  and  concise  fashion. 

Approach  y' 

The  study  was  conducted  in  two  discrete  phases.  In  this  first  phase,  research  reports  and  Journal  artielas  ware  aurveysd 
to  ascertain  the  range  of  methods  used  by  epldanlologlats  and  medical  rasaarehars.  These  methods  were  placed  into  two  cate¬ 
gories:  The  first  category  included  those  statistical  methods  and  concepts  coamonly  used  by  psychologists;  the  sacond  cate- 
gory  Included  methods  and  concepts  which' appeared  to  be  unique  to  epidemiology  and  blostatlstica. 

In  the  second  phase,  several  epidaalologlc  and  blostatiatics  teicts  were  consulted  and  used  in  an  attempt  to  acplaln  tha 
principles  behind  the  methods concepts  placed  in  the  second  category.  Examples  which  displayed  the  potential  appllea- 
tlons  of  these  methods  claP  were  selected. 
hsaults 

The  survey  raviaalad  many  similarities  in  statistical  methods  which  are  quits  fhadllar  to  psychologists.  Hypothasas  arc 
formulated  and  tested  in  laieh  the  same  manner  and  chi-square,  regression,  eorradatlon,  and  snalysaa  of  wrlanee  are  commonly 
employed  In  studies  of  morbidity  and  mortality. 

It  also  was  found  that  apldeadologle  studies  aaploy  rates  and  measures  which,  although  saldom  sean  In  psychology,  are 
based  en  statistical  concepts  and  principles  underlying  the  methods  developed  by  psychologists.  Rates  such  as  tbs  standard¬ 
ised  mortality  ratio  sod  Ineldanea  and  prevalence  retaa  are  measures  of  probability.  Nsasures  of  association  such  as  ths 
relative  risk  and  phi  coefficient  are  grounded  In  the  comparison  between  observed  and  eRpactad  fraquanciaa  cn  lAleh  tha  dd- 
^qnare  teat  aaployad  by  psychologists  Is  based. 


It  la  concluded  that,  dasplta  ths  dlffarancaa  In  tsxadnology  and  frequent  use  of  retaa  whldi  axe  act  food  la  psychol¬ 
ogy,  the  gap  between  blostatlstles  and  psyeholaglcal  ststiatlcs  Is'  neither  large  ner  cempleK.  Beeanee  the  methods  of  opi- 
deatlology,  bloetatlstles,  and  poynhdlagjr  ere  baaed  on  Boaniw  etattettoSl  pclnelplae,  relatively  fhm  shifte  la  etetlotlcal 
4ttafciag  are  xnqntxed,  other  than  an  maderetaadtag  of  tha  terelnolefy  emplayod,  flnr  peydialagiBts  ts  aetata  a  haste  wenpre 
laMlaa  of  aptdeatelegle  ftadlege.  * 
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An  Epidemiology  Trimer:  Bridging  the  Cap 
between  Epidemiology  and  Psychology 


Psychologists  in  recent  years  have  begun  to  take  an  active  role  in  health-related  research:  to  pose  research  questions, 
raise  methodological  issues,  and  furnish  answers  to  biomedical  problems.  In  this  new  role  they  are  likely  to  experience 
some  degree  of  confusion  and  frustwition.  familiar  ground  may  be  obscured  by  new  labels  or  subtle  changes  in  landscape. 
Areas  formerly  traversed  with  confidence  may  now  cause  uncertainty  or  trepidation.  It  may  be  felt  that  bi  ostat  i  si  ics  repre¬ 
sents  an  entirely  new  set  of  concepts  and  methods  that  requires  extensive  '*re-tool  ing”  for  understanding  and  application. 

How  readily  can  psychologists  familiarize  themselves  with  this  *’new  ground"?  What  shifts  in  statistical  thinking  are  neces¬ 
sary  to  attain  a  basic  ctxnprehension  of  epi demiol ogic  findings?  The  objective  of  this  paper  is  to  answer  these  questions  by 
examining  statistical  techniques  cocnmonly  used  in  epidemiologic  studies  that  are  generally  unfamiliar  to  psychologists  and, 
thus,  to  some  extent  bridge  the  communicat ion  gap  between  these  disciplines. 

To  begin,  it  must  he  recognized  that  all  applications,  whether  in  the  biological  and  health  sciences  or  psychology, 
derive  from  the  same  general  theory  of  statistics.  Therefore,  biostatisticians  and  psychologists  are  taught  the  same  basic 
statistical  concepts  and  methods.  The  existence  of  a  common  background  can  be  shown  by  a  comparison  of  textbooks  in  these 
fields  which  reveals  very  similar  tables  of  contents  including  chapters  on  descriptive  statistics,  probability,  the  binomial 
distribution,  the  normal  distribution,  estimation,  hypothesis  testing,  chi-square  tests,  correlation  and  linear  regression, 
analysis  of  variance,  and  nonpammet rio  methods.  Epidemiologists  and  biostatisticians  also  are  taught  the  special  tools  of 
demography  and  vital  statistics  (morbidity  and  mortality  rates  and  ratios)  while  psychologists  typically  learn  factor  analj^ 
sis  and  psychological  scaling  techniques,  for  the  psychologist,  the  area  of  vital  statistics  is  undoubtedly  very  puzzling 
because  of  the  unfamiliar  terminology  and  the  unique  epidemiologic  perspective.  Therefore,  our  discussion  will  begin  with  a 
brief  description  of  the  typical  appnaches  taken  in  epidemiologic  studies,  specifically  the  nature  of  cross-sectional, 
retrospect i ve,  and  prospective  studies  and  the  case-control  and  cohort  methods.  From  this  perspective,  we  will  consider 
rates  and  ratios  ct»mmonly  employed,  methods  used  to  adjust  rates  so  that  diverse  populations  can  be  compared,  and  measures 
of  asstxjiation  among  variables  that  affect  morbidity  and  mortality. 

EPIPEMIt.T.QGlC  APPROACHES  TO  THE  STUDY  OF  ILUIESS 

MacMahcm,  Pugh,  and  Ipsen  (I960)  divide  epidemiologic  investigations  into  four  separate  categories: 

(1)  Descriptive  Epidemiology.  Descriptive  epldonlology  is  concerned  with  distributions  of  disease  and  comparisons  of 
different  populations  or  different  segments  of  the  same  population  on  mozhidity  and  mortality  indices. 

(2)  Formlation  of  Hypotheses.  Tentative  explanations  of  observed  disease  distributions  are  attaapted  in  terms  of 
possible  causal  associations  of  a  direct  nature. 

(3)  Analytic  Cpidealology.  This  branch  of  epidemiology  consists  of  observational  studies  designed  specifically  to 
•(•■line  and  test  hypotheses  developed  from  descriptive  studies. 

(4)  toperimental  Epidemiology.  Dcperlmental  studies  are  conducted  on  hunan  populetiona  to  confini  in  a  rigorous  eenner 
hypotheses  that  stand  the  last  of  observational  analytic  studies. 

It  seams  apparent  that  catagoriaa  (2)  and  (3)  refer  to  aathoda  of  statiatlcal  inference  and  could  readily  be  ccmbinad. 
We  trill  treat  both  of  these  categories  under  the  hasdlog  of  analytic  apldmaiology  where  the  apldamlologlst  or  bioatatiatl- 
eian  attmipta  to  derive  inductions,  gananilisatloaa,  or  conelualona  about  quaatlona  he  has  poaad  by  following  accaptad  rules 
of  avldanca.  (The  peychologlat,  of  course,  followa  the  ••■•  body  of  rulaa  for  arriving  at  conclusiona  also  «BCtaDdlng  bgyond 
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the  immediate  data.)  Ibcperimental  studies  in  epidemioliigy,  that  fs,  random  assignment  of  individuals  to  tscpcjsure  situations 
or  treatments,  are  rarely  possible  because  of  practical  and  ethical  constraints  and  will  not  be  considered  further  here. 

This  {«per  will  be  concerned  only  with  descriptive  and  analytic  epidemiology. 

Di:SCRl  iTTVi:  bl’lhlXTOUlGY 

Descriptive  statistics  in  epidemiology  involves  the  use  of  standardized  indices  that  reflect  typical  or  usual  values, 
the  amount  of  variability  in  sets  of  observations,  and  relationships  am<ing  variables  of  interest.  Thus,  descriptive  epide¬ 
miology  provides  methods  for  organizing,  summarizing,  and  communicating  study  results.  Descriptive  epidemiology  is  rj<it  ct»n- 
cemed  with  the  causal  implications  «>r  ccmclusi(»ns  that  might  he  drawn  fr<jin  sets  of  data;  sjich  inferences  are  the  province  of 
analytic  epidemiology. 

The  most  commonly  used  descriptive  variables  in  epidemiology  pertain  to  time,  person,  and  place.  According  to  MacMahon 
et  al.  (1960),  ’’Statements  of  the  frequency  of  a  given  trait  or  disease  manifestation  in  various  populations  are  essential 
to  descriptive  epidemiology.  Such  statements  permit  comparisons  between  populations  and  between  subgroups  of  a  population 
with  respect  to  the  manifestation  in  question”  (p.  SI).  To  control  for  differences  in  population  or  sample  size,  frequencies 
must  be  expressed  in  the  form  rates. 

The  calculation  of  rates  is  a  simple  procedure  performed  frequently  in  both  epidemiological  and  psychological  research. 

It  is  a  statement  of  probability  expressed  in  a  quantity  or  degree  of  the  phenomenon  measured  per  unit  of  population  per  unit 
of  time.  What  is  known  in  epidemiology  as  a  specific  rate  is  known  in  probability  theory  as  a  conditional  probability. 

In  epidemiology,  this  statement  of  probability  involves  three  different  items  of  information:  (1)  the  number  of  persons 
affected  by  a  particular  illness,  expressed  as  the  numerator,  (2)  the  population  within  which  the  affected  persons  arc 
ibserved,  expressed  as  the  denominator,  and  (3)  a  specification  of  the  time  interval, 

ANALYTIC  EPlDPilOlOGY 

(»n  the  basis  of  a  comparison  of  these  rates  in  different  populations  or  subgroups  of  a  population,  tentative  hypotheses 
are  formulated  which  posit  causal  connections  between  the  observed  distributions  of  disease  and  one  or  more  variables  or 
characteristics  of  the  population.  These  hypotheses  are  then  tested  by  specially  designed  observational  studies. 

Analytic  studies  are  typically  conducted  to  detennine  whether  or  not  an  association  is  present  between  a  certain  charac¬ 
teristic  or  combination  of  characteristics  and  a  disease  in  a  group  of  afflicted  individuals.  In  these  studies,  comparisons 
are  made  between  a  group  of  persons  who  have  the  disease  and  a  group  that  does  not.  The  methods  employed  in  these  studies 
"depend  upon  observing  (hence  the  term  'observational*  studies)  and  quantifying  whatever  is  being  studied"  (Ibrahim  & 

Spitzer,  1979,  p,  139).  An  example  of  a  study  in  psychology  of  this  type  would  be  the  comparison  of  IQs  among  students  of 
different  ethnic  groups  to  determine  if  an  association  exists  between  race  and  intelligence. 

Analytic  studies  usually  fall  within  one  of  two  broad  categories:  case-control  and  cohort.  Both  of  these  are  referred 
to  in  the  research  literature  by  various  names,  thereby  creating  some  confusion  over  the  nature  of  their  differences 
(Feinsteln,  1979;  Ibrahim  A  Spitzer,  1979;  Lilienfeld,  1976;  MacHahon  et  al,,  1960).  In  the  case-control  method,  affected 
(cases)  and  nonaffected  (controls)  groups  are  compared  to  determine  whether  a  particular  characteristic  occurs  with  greater 
frequency  among  those  affected  by  a  certain  disease  or  illness.  There  are  two  major  forms  of  case-control  study,  retro¬ 
spective  and  cross-sectional.  In  the  retrospective  study,  the  objective  is  to  establish  If  the  chaxacteriatic  was  present  in 
the  past.  The  Investigator  looks  backward  In  time  for  exposure.  In  the  croea-sectional  study,  the  characteristic  being  coa»- 
pered  is  present  in  both  cases  and  controls  at  the  time  of  the  investigation.  In  both  types  of  study,  analysis  proceeds  from 
effect  to  cause. 

The  second  category  of  analytic  study  employs  the  cohort  method.  With  this  design,  a  particular  population  is  examined 
to  detetmine  if  the  characteristic  that  nay  be  related  to  the  dieeeee  being  investigated  is  present  in  significant  quantities. 
The  reaeercher  locdce  forwerd  In  time  for  exposure  and  analysis  proceeds  from  cause  to  effect.  In  this  approach,  also  refer- 


4 


red  to  as  prospective  study,  the  population  may  followed  f<»r  several  years  to  certify  which  members  develop  or  die  from 
the  disease. 

There  are  numerous  advant«ines  and  di sadvant ages  with  either  research  approach,  I’rospective  studies  enable  the  researcher 
to  obtain  direct  estimates  of  the  risk  ass»iciated  with  a  suspected  causal  factor  and  to  re<luce  the  probability  of  spurifius 
relationships  resultinji  fmm  bias  in  data  collectini*  procetiures,  but  they  also  (generally  are  laborious,  t  ime-ronsurning,  and 
€Kpensive  {MacMahon,  et  al,,  p.  471.  The  retrospective  study,  on  the  other  hand,  is  relatively  quick  and  inexpensive, 

easily  repeatable,  and  ran  cronomi ral  1  y  exe»mine  a  large  number  of  cases.  However,  it  also  is  subject  to  selection,  infor¬ 
mation,  and  confounding  liiases  C lei nst ein,  I07‘):  Ibrahim  X  Spitzcr,  1979). 

Whatever  their  relative  merits  and  disadvantages,  the  objective  <*f  both  types  of  study  is  to  determine  whether  or  not  a 
relationship  exists  between  a  particular  trait  or  set  of  traits  and  a  specific  illness  or  disease.  The  comparison,  in  its 
simplist,  dichotomous  form,  is  usually  represented  in  a  2x2  table,  as  shown  in  Table  1.  If  a  higher  proportion  of  individuals 
with  the  characteristic  is  f«iund  among  the  cases  than  the  controls,  an  association  between  the  disease  and  the  characteristic 
i  s  i  nd  i  ca  t  od . 


Table  1 

framework  for  the  Study  of  Disease 

Number  of  Individuals 


With  Without 

(J^aracteristic  Disease  Disease 

With  a  b 

Without  c  d 

Total  a+c  b+d 


STATISTICAL  METHODS  FOR  DESCRIPTIVE  EPIDEMIOLOGY 


Total 
a+b  =  Nj 
c+d  ■  N2 
a+b+c+d  "  N 
N 


Calculation  of  Rates 

The  description  of  a  particular  illness  may  utilize  one  or  both  of  two  types  of  rates,  mortality  and  morbidity.  To  conn 
pute  a  m«,rtality  or  death  rate,  the  following  specific  information  is  needed:  (1)  In  the  numerator  is  included  the  number  of 
deaths  in  the  exposed  or  affected  population  during  a  certain  time  period.  (2)  In  the  denominator  is  the  total  population 
group  exposed  to  the  risk  of  death,  (3)  A  time  factor,  usually  a  l-year  Interval,  is  specified.  The  annual  death  rate  can 
be  calculated  with  this  Information. 


Annual  death  rate  (ADR)  Total  nun^er  of  deaths  during  a  specified  period 

from  all  causes  (per  of  a  yeer  x  1,000 

1,000  population)  Number  of^  period  in  the  population  at  mid-year 


Thus,  if  1,200  deaths  occurred  In  a  population  of  1,000,000  in  1980,  the  annual  death  rate  would  he: 


Annual  death  rate  (ADR)  1,200  deaths  in  1980  x  1,000 

In  1980  (per  1,000  =  1,000,000  persona  present  ae  of  July  1980  =  1.2  per  1,000  populatiMi. 

population: 

The  unite  of  time  and  population  may  be  selected  by  the  inveett^tor  to  suit  hie  own  purpoeea,  but  they  must  be  speci¬ 
fied.  Death  rates  also  can  be  made  specific  for  a  variety  of  cha  met  eristics,  such  as  age,  cause  of  death,  marital  status, 
mee,  end  occupation. 


ADR  frooi  all  cauaoe  Nwber  of  deaths  of  persona  18-24  during 

for  pereone  18-24  (per  *  ■  period  of  1  year  X  1,000 

1,000  population)  Nuoi^ar  of  18-M  year  olda  in  the  popula- 

tlon  at  mld-yoar 
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ADR  from  lung  cancer 
(per  1,000  population) 


X  1,000 


Number  of  deaths  from  lung  cancer 
per  year 

Number  of  persons  in  tlie  popula- 
tion  at  mid-year 


Another  type  of  rate  frequently  used  is  the  "case  fatality  rale”: 

Number  oi'  individuals  dying  during  a  specified 
Case  fttality  rate  (  V)  =  period  of  time  after  onset  or  diagnosis  of  disease  X  loo 

Number  oi*^individuals  t^'th  the  sp^ified  disease 


This  rate  represents  the  risk  *»f  dying  during  a  definite  period  of  time  for  those  individuals  who  liave  the  particular  dis¬ 
ease.  As  with  the  death  rate,  the  period  of  time  during  which  the  deaths  occurred  should  be  indicated.  Case  fatal  ity  rates 
also  can  be  made  specific  for  age,  sex,  severity  of  disease,  and  any  other  factors  of  clinical  and  epidemiological  im[)orlance. 
The  third  mortality  rate  used  in  epidemiologic  research  is  the  proportion  of  total  deaths  due  to  a  specific  cause; 

Proportionate  mortality  rate  Number  of  deaths  from  cardiovascular  diseases 

fr«*m  cardiovascular  diseases  in  the  U.S.  Navy  in  1070  X  loo 

in  the  I'.S,  Navy  in  1970  Total  deaths  in  tlie  U.S.  Navy  in  1970 

However,  since  this  rate  depends  on  l-w<t  variables,  it  is  of  limited  value  in  making  comparisons  between  different  populations 
or  time  periods.  It  also  fails  to  directly  measure  the  risk  or  probability  of  a  person  in  a  population  dying  from  a  specii’ic 
disease  as  does  a  cause  specific  mortality  rate. 

One  of  the  most  frequently  used  morbidity  rates  is  the  incidence  rate  which  is  defined  as  the  number  of  new  cases  of  a 
disease  that  occur  during  a  specified  period  within  a  specified  unit  of  population. 

Number  of  new  cases  of  a  disease  occurring 

Incidence  rate  per  l,00n  =  in  a  population  during  a  specified  period  of  time  X  1,000 

iTimiber  of  persons  exposed  risk  of  developing  the 
disease  during  that  period  of  time 

Another  morbidity  rate  is  the  prevalence  rate,  which  measures  the  number  of  cases  that  are  present  at,  or  during,  a  specified 
period  of  time.  The  prevalence  rate  equals  the  incidence  rates  times  the  average  duration  of  the  disease,  for  ©cample,  if 
the  average  duration  of  hypertension  is  three  years  and  its  incidence  rate  is  15  per  1,000,  the  prevalence  rate  would  be  45 
per  1,000. 

Number  of  cases  of  disease  present  in 

I'revalence  rate  per  1,000  =  the  population  at  a  specified  time  X  1,000 

Niffliber  of  persons  in  the  population 
at  that  specified  time 
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The  two  types  of  prevalence  rates  which  are  used  by  investigators  are  point  prevalence  and  period  prevalence.  Point 
prevalence  refers  to  the  number  of  cases  present  at  a  specified  moment  in  time;  period  prevalence  refers  to  the  number  of 
cases  that  occur  during  a  specified  period  of  time,  for  example,  a  year.  Period  prevalence  consists  of  the  point  prevalence 
at  the  beginning  of  a  specified  period  of  time  plus  all  new  cases  that  occur  during  that  period. 

All  forms  of  morbidity  rates,  including  incidence  and  prevalence  rates  can  be  made  specific  for  age,  sex,  and/or  any 
other  personal  chaificteristics.  They  also  can  be  standardized  in  the  same  manner  as  mortality  rates. 

AGE  ADJUSTMEirr  TOU  MORTALITY  RATES 

The  population  characteristic  that  has  the  greatest  influence  on  nortallty  rate  is  the  age  of  the  members.  Since  dif^ 
femees  in  the  age  compoaitlon  of  a  population  will  influence  the  total  mortality  rates,  it  Is  preferable  to  use  age  spe* 
cific  mortality  rates  In  coaniering  the  mortality  experiences  In  two  different  geographical  areas,  population  groups,  or  time 
periods.  To  control  for  differences  In  the  age  distribution  of  a  population,  two  different  susaary  statistics  may  be 
sBployed:  the  direct  method  of  age  adjustment  and  the  standardised  mortality  ratio.  Both  rely  upon  a  conpariaon  of  SKpected 
ntee  of  a  standard  or  control  group  with  the  observed  rates  of  the  population  under  study. 
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Direct  Method  of  Age  Ad.lustment.  The  basic  procedure  for  this  method  is  to  apply  the  age-specific  mortality  wtes  for 
the  two  groups  that  are  being  compared  to  the  number  in  the  same  ag2  groups  of  the  standard  population.  For  most  studies  con¬ 
ducted  in  the  United  States,  the  standard  population  is  the  population  of  the  U.S,  as  determined  in  the  1040  census.  This 


procedure  gives  the  number  of  deaths  that  can  be  expected  in  the  standard  population  if  these  age-specific  rates  from  the 
observed  populations  had  prevailed  in  the  standard  population.  An  example  of  the  use  of  this  method  to  adjust  the  calcula¬ 
tion  of  a  mortality  rate  to  control  for  age  is  found  in  Table  2. 


Table  2 

Calculation  of  the  Age-Adjusted  Mortality  Rates  from  All  Causes 
by  the  Direct  Method:  United  States,  1^-50  and  1960^ 


Expected  Number 


Mortality  from  All  Standard  Population:  of  Deaths  that  Would  Occur 

Causes  per  100,000  Total  U.S.  Enumerated  in  Standard  Population  at 


Age 

Populat 

ion _ 

Population  per  1,000,000 

Rates  in 

Group 

( Yea  rs ) 

1950 

1960 

1940 

1950 

1960 

'  ■ 

■ 

(1) 

(21 

(3) 

(l)x(3') 

('2)x(3) 

<  1 

3,299.2 

2,696.4 

15,343 

506.2 

413.7 

1-4 

139.4 

109.1 

64,718 

90.2 

70.6 

5-14 

60.1 

46.6 

170,355 

102.4 

79.4 

15-24 

128.1 

106.3 

181,677 

232.7 

193.1 

25-34 

178.7 

146.4 

162,066 

289.6 

237.6 

35-44 

358.7 

299.4 

139,237 

499.4 

416.9 

45-54 

853.9 

756.0 

117,811 

1,006.0 

890.7 

55-64 

1,901.0 

1,735.1 

80,294 

1,526.4 

1,393.2 

65-74 

4,104.3 

3,822.1 

48,426 

1,987.5 

1,850.9 

75-84 

9,331,1 

8,745.2 

17,303 

1,614.6 

1,513.2 

854 

20,196,9 

19,857.5 

2,770 

559.5 

550.4 

Total  death  rate 

all  ages 

963.8 

954,7 

— 

— 

““ 

Total  population 

- 

1,000,000 

- 

- 

Total  expected 

number  of  deaths 

— 

— 

— • 

8,414.5 

7,609.7 

Age-adjusted  death 

rate  per  100,000 

-* • 

— 

841.45 

760.97 

^Source:  Klebba,  Mauer,  and  Glass 

In  Table  2,  the  age-adjusted  rate 

(1973) 

in  1960  is 

much  lower  than  in  1950,  in  contrast 

to  the  total  death 

rates  where  the 

1960  death  rate  is  only  slightly  lower.  The  difference  between  the  changes  in  the  total  and  age-adjusted  death  rates  results 
from  the  fact  that  the  1960  population  has  a  larger  proportion  of  people  in  the  older  age  groups  than  the  1950  population. 

The  total  death  rate  is  affected  by  both  the  age-specific  death  rates  and  the  age  distribution  of  the  population.  The  age 
adjustment  procedure  is  used  to  remove  the  influence  of  the  age  distribution  of  the  population  by  use  of  a  standard  popula¬ 
tion. 


Standardized  Mortality  Ratio  (SMF).  A  second  method  of  age  adjustment  Is  a  statistic  widely  used  in  studies  of  occupa- 
tiofwl  mortality.  It  is  defined  as  the  mssber  of  deaths,  either  total  or  cause-specific,  in  a  given  occupational  group 
SKpressed  as  a  percentage  of  the  nuaber  of  deaths  that  would  have  been  wcpsctBd  in  that  occupational  group  ^  the  age-  and 
B«c-specific  ratee  In  the  general  population  were  applicable.  The  etatiatlc  is  calculated  by  using  the  formula: 


Standard  Mortality  Ratio  (SNR)  =  Observed  nmiber  of  deaths  per  year  X  100  =  % 

^icpected  matMT  deat'lia  per  ymr 

The  acpeeted  nuad>er  of  deathe  per  yeer  of  a  particolar  eaaple  is  eelculated  by  using  the  equation  N  =  ^b  where 
a  -  the  mvber  of  aaaple  aarfiara  belonging  to  e  particular  age  group 
b  =  the  etandard  d«th  rate  In  the  general  population  for  that  eaae  age  group. 
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An  example  of  this  prcKredure  is  found  in  l-ilienfeld  (1*^76).  hetween  1949  and  19S3,  there  were  7,320  deaths  atnonp  male 
famers  and  f'ann  fTWinajjers  in  i.ngland,  «jr  an  average  of  1,464  deaths  per  year.  In  determining  whether  this  figure  indicates 
a  tmrTThil,  high,  or  l<iw  mortality  risk  for  this  grt)Up,  a  standardized  mortality  ratio  is  calculated  using  the  slandartj  death 


rates  i?!  Ingland  (see  Table 


Tal>lG  3 

taloulation  of  the  Standardized  Mortality  Ratio  for 
()ccn|)a1  ion  oi'  Male  farmers  and  farm  Managers  for  All  iauses  oi'  heath: 

l‘»S|a 


Ago 


•J5~34 
3  5-44 
45-54 
55-04 


Number  of  fanners 
an<l  farm  Managers 
if'ensus,  19511 

_ in _ 

7,989 

37,030 

00,838 

68,687 

,SS,565 


Total  expected  deaths  per  year:  2,083 
Total  observed  deaths  per  y^r:  1,464 


SMR 


Standard  DeiJth  Kates 
per  1,000, *100  fAll 
Causes  of  Death) 

_ L2_L_ 

1,383 
1 ,594 
2,868 
8,212 
22,953 


1,464 

2,083  X  100  =  70.3?' 


^Source:  Registrar  General’s  Decennial  Supplement  (1958). 


fxpe<’le<l  Numljer  of 
Deiiths  for  fanners 
and  farm  Managers 
per  1  ,ooo,ono 

i3)  (])  X  (2) 

U 

59 

174 

564 

1,275 


Table  3  gives  the  results  of  such  a  calculation.  The  SMR  indicates  that  the  mortality  experience  of  farmers  and  farm 
managers  was  only  70. 3Z  of  the  total  population  rate  from  all  causes  of  death. 

Although  the  SMR  is  a  widely  used  statistic  in  epidemiology,  it  also  possesses  certain  limitations.  Wong  (1977),  fur 
instance,  notes  that  the  comparison  of  SMRs  is  questionable.  Gaffey  (1976)  cites  three  specific  limitations  to  the  SMR: 

(1)  the  lack  of  a  relationship  between  the  SMR  and  the  life  expectancy  of  a  particular  population;  (2)  the  unequal  sizes  of 
the  SMR  and  the  relative  risk,  the  discrepancy  depending  on  the  age  of  the  study  population,  and  (3)  at  older  ages,  the  SMR 
is  subject  to  limitations  on  its  possible  values,  more  or  less  independently  of  any  hazard  to  which  the  study  population  may 
be  exposed.  Based  upon  these  limitations,  Gaffey  recommends  jgaint  the  use  of  the  SMR  as  an  estimate  of  relative  risk, 
believing  that  the  SMR  in  general  will  be  a  biased  estimate  of  that  relative  risk  and  its  bias  will  be  different  with  oach 
age  group.  However,  Symons  and  Taulbee  (1981)  state  that  the  SMR  can  be  a  useful  approximation  of  relative  risk  when  (1)  the 
age-specific  rates  in  the  comparison  population  for  the  cau8e(s)  of  interest  are  no  larger  than  about  100  per  10,000  subjects 
per  year,  (2)  the  age  bands  are  not  too  broad,  and  (3)  the  age-specific  mortality  rates  for  the  study  and  comparison  popula¬ 
tions  are  in  approximately  constant  ratio  across  the  age  bands. 

MEASURES  OF  ASSOCIATION 

In  both  retrospective  and  prospective  studies,  the  object  of  research  is  to  determine  whether  or  not  a  correlation  can 
be  established  between  a  specific  characteristic  and  the  disease  being  examined.  Tsychologists  have  Icmg  employed  statisti¬ 
cal  methods,  such  as  analyses  of  variance  and  regression  analyses,  to  measure  the  strength  or  degree  of  association  between 
the  variables.  In  sons  instances,  epldoilologlsts  employ  similar  methods;  other  instances  require  the  use  of  methods  not 
found  in  psychology.  In  this  paper,  four  specific  methods  conmonly  used  in  biostatistics  which  examine  the  relationship  or 
association  between  two  or  more  variables  are  reviewed:  Poisson  distributions,  relative  risk,  measures  based  on  chi-squares, 
and  attributable  risk. 

Poiaaon  Dlatributiona.  Hypothaala  testing  Involves  the  use  of  a  sampling  distribution  in  which  probabilities  (ocpected 
frequancias)  are  compared  with  outcome  (observed  frequenciee).  A  dlatributlon  of  probabilities  indicates  the  likelihood  of 


I 
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each  of  the  ohperved  frequencies  if  the  assuinptions  made  rejiarding  the  phen«nenon  under  study  are  actually  correct.  There 
are  three  pmbnhilily  dist  rihu1  ions  commonly  employed  in  sintislical  analyses:  the  normal  dist  rihui  ion,  binomial  dislribu- 
tion,  and  I'oisson  d  i  si  ribul  i  on.  Psycliolo^ist  s  are  faniiliarwilh  the  first  Ivo:  t  lie  third,  however,  is  more  cfimnujnly  I'otind 
in  biomedical  research.  Although  the  1‘oisson  distribution  is  often  used  as  an  approximation  of  the  Ijinomial  d  i  st  ri  l)ul  i  ons , 
it  may  provide  a  useful  distril>ut  ion  in  its  *H'm  right. 

The  i'oisson  distribution  is  used  when  events  or  entities  are  random  and  independent  of  each  other  and  when  the  proljabil- 
ity  of  an  event  is  ver.\’  srwall  am),  even  if  the  sample  sive  is  large,  only  a  small  number  of  events  are  observed.  These  dis¬ 
tributions  are  likely  1(‘  be  obtaineil  wlien  the  f)bsGrva1  ions  consist  of  counts  such  as  the  number  of  cases  of  a  [Virticular  ill¬ 
ness  over  a  fixed  perio*!  of  time. 

The  I'oisson  distribution  ntnsists  of  a  distribution  of  random  variables  taking  values  of  0,  1,  2,....  If  the  variable 
X  has  a  jwrticular  value,  then  the  probability  from  the  I'oisson  distribution  that  this  value  will  occur  is 

e~H  pX 
xT 

The  quantity  *’e”  in  this  formula  is  a  constant  with  a  value  approximately  equal  to  2,71828.  For  variables  that  have  a 
I'oisson  distribution,  the  mean  and  the  variance  ore  equal,  that  is  to  say,  p  "  <7.  In  contrast  to  the  binomial  distribution 
v.hich  requires  a  knowledge  of  n  and  p,  the  I'oisson  distribution  requires  only  a  knowledge  of  the  distribution  mean,  or  p, 
which  can  take  any  value  greater  than  zero. 

Kemington  and  Schork  ('I97ni  present  two  general  models  which  demonstrate  the  utility  of  the  I’oisson  distribution.  The 
first  is  characterized  by  a  large  quantity  of  some  medium  such  as  sea  water  or  air  in  which  are  found  a  large  number  of  small, 
discrete  entities,  such  as  plankton  or  bacteria.  One  of  the  most  important  traits  of  this  model  is  that  there  is  a  uniform 
density  of  the  small  entities  throughout  the  medium.  When  a  small  quantity  of  this  medium  is  examined,  the  probability  that 
this  sample  will  contain  x  number  of  entities  is  the  I’oisson  probability. 

The  second  model  producing  I'oisson  probabilities  concerns  events  occurring  in  time.  Such  an  event  would  include  members 
of  a  community  who  contract  a  particular  illness  or  disease.  If  the  events  occur  independently,  the  probability  that  an 
event  will  occur  in  a  short-time  interval  is  proportional  to  the  length  of  the  interval,  and  the  time  interval  is  short 
enough  such  that  the  probability  of  more  than  one  event  occurring  in  such  a  time  interval  is  negligible,  then  the  probability 
that  X  events  occur  in  a  fixed  time  interval  is  the  I'oisson  probability. 

Relative  Risk.  The  most  common  measure  of  association  in  retrospective  studies  is  relative  risk  which  reflects  the 
incidence  of  disease  among  a  group  possessing  a  certain  characteristic  relative  to  a  group  without  the  characteristic.  The 
measure  Indicates  the  likelihood  a  member  of  a  specified  population  will  acquire  and/or  succumb  to  a  disease  if  he  possesses 
the  characteristic  under  study.  Thus,  a  study  which  determines  that  the  relative  risk  of  lung  cancer  for  cigarette  smokers 
is  3,3  is  stating  that  the  risk  of  contracting  lung  cancer  is  3,3  times  greater  for  smokers  than  for  nonsmokers. 

Relative  risk  is  calculated  from  a  2x2  table  in  which  the  nwr^er  of  cases  and  controls  are  compared  with  respect  to  the 
presence  or  absence  of  a  particular  characteristic  (see  Table  1),  The  cross  products  are  then  multiplied  and  divided,  pro¬ 
ducing  the  following  equation: 

ad 

RR  =  ^ 

This  equstlon  gives  an  approximation  of  relative  risk  and  assiiws  that  (1)  the  cases  and  controls  have  been  selected  at  raiv- 

dom  and  are  representative  of  the  larger  population,  and  (2)  the  frequency  of  the  disease  in  a  population  is  relatively  small. 

ad 

If  RR  is  equal  to  I  or  unity,  then  Be  as  an  apprcKimation  of  relative  risk  is  exact.  This  equation  is  the  one  used  most 
often  in  calculating  relative  risk. 

If  the  frequency  of  disease  in  a  population  is  large  or  the  approKlmation  of  RR  proves  to  be  Inadequate,  i.e. ,  In  cases 
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where  tliere  are  multiple  catejiories  t»l‘  ^iroU[)s— <li  I'rerent  siihuroups  I>y  ajje  or  oern|vition — iin<ler  study,  1  lujre  is  a  more  accu¬ 
rate  iiHVisure  developed  l>y  Mantel  and  ilaeiiszel  Tl'tS'}).  The  revise*!  relative  risk  is  calculated  as  ('tdUjws: 


'<'Vh 


N 


In  addition.  Mantel  and  Haensyo)  have  calculatotl  summary  relative  risk  f»qiin1  ions  for  sepfirato  subcat  ejjories  of  exposure.  The 
rationale  lUr  those  «*<iuat  i  on.‘-'  i--'  that  "...over-all  relative  risk  estimates  are  averaf*es  and  ns  averaf»os  m/jv  conceal  substan¬ 
tial  variation  in  the  irvi^nitudes  ol'  the  relative  risk  nmonji  subtiroiips.  Ordinarily,  the  individual  subcatG^iorv  data  slionlt!  ho 
exa'vi  nod ,  jviyinp  special  attention  t  (j  relative  risks  base<l  on  r*iasonably  larjie  sample  sizes.  This  will  proviih;  protectifjn 
against  the  potential  deficiencies  of  any  iw'ii*t  icul.ar  summary  relative  risk  formula  employed"  (Mantel  K  Maenszel  ,  o. 

"4(^1.  An  f:*xa:i!p!e  of  such  a  summary  relative  risk  formula  is  tl>G  following: 


ad 

RH  be 

lYaVnid') 

TTTTMTrr 

where; 


= 

NiM, 

N 

N,M2 

N 

Lf  c  1  = 

N^Mi 

N 

rfdi  = 

N2M2 

N 

A  simpler  method  of  calculating  relative  risk  for  multiple  categories,  however,  is  to  prepare  a  series  of  2x2  tables 
comparing  controls  and  cases  at  different  levels  of  ©eposure  and  then  to  compute  the  relative  risk  for  each  table.  An 
example  of  such  a  procedure  is  found  in  r.ilienfeld  (1976)  (see  Table  4). 

Age  ad.justment  procedures  are  also  important  when  calculating  relative  risk.  One  such  procedure  is  the  matching  case 
method  in  which  a  sample  of  N  diseased  individuals  is  drawn  and  the  characteristics  of  each  individual  noted  with  respect  to 
the  control  factors.  Subsequent  ly,  a  sample  of  N  well  individuals  is  drawn,  with  each  individual  matched  on  the  control 
fact<irs  to  one  of  the  diseased  individuals.  In  applying  such  a  procedure,  the  2x2  table  takes  on  a  different  form  from  that 
shown  in  Table  4,  The  cell  in  Table  S  in  the  upper  left-hand  comer  contains  r  number  of  pairs  in  which  both  cases  and  con¬ 
trols  possess  the  characteristic  of  interest.  The  marginal  totals  (a,b,c,d)  represent  the  entries  in  the  cells  of  Table  L 
and  the  total  for  the  entire  table  is  ‘^N  pairs  where  N  represents  the  total  number  of  paired  individuala  The  calculation  of 
the  relative  risk  for  this  table  would  be: 

^  Y  (provided  t  ^  0) 

Table  4 

Relative  Risk  for  Smokers  and  Nonsmokers 


Example  of  Calculating  Relative  Risk  for  Multiple  Categories 


Daily  Average 
igarettes  Smoked 

bung  Cancer 

I’atients 

Ccoitrols 

Relative  Risk  of  Different 
Categories  of  Smokers 
to  Nonsmokers 

0 

7 

61 

1.0 

1-4 

55 

129 

3.7 

5-14 

489 

570 

7.5 

15-24 

475 

431 

9.6 

25-39 

293 

154 

16.6 

50+ 

38 

12 

27.6 
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The  different  cletirees  <»r  levels  of  cijjarette  smoking  are  t<i  !)»»  etmiwrcMl  with  the  niTismokers,  and,  1  her»?f<ii*e,  t  fie  relative 
risks  of  cancer  for  nonsmokers  is  taken  to  he  l.n.  The  risks  for  smokers  comjvi  red  to  ruiiismoker‘^  are; 


K'K  ( f  —1  c  i  j'a  r'Ot  f  es  Wa  i  i  v  ^ 

KK  i>-l4  c i jiarol  1  es  {lailyf 
UK  MK-’J4  cij'areites  daily! 


X  4:m 


Tahle 

'^lo(iel  of  (^^l^ula1ion  of  Kelalivt?  Risk  for  Matched  ‘Vi«;es 
an<i  Controls  With  and  Uitlmui  a  «  Ivi  ract  eri  st  ic 


Cases 

With  cliaractcrist  ic 
Without  charact  erist  ic 
Total 


c.ont  rol  c 

With  Wit  houl 

Cha ract eri st  ic  (  ha root eri si  ic  Total 

r 
1 

h* 


*a,h,o,  and  d  are  the  entries  in  the  cells  in  Table  1. 


A  test  of  whether  or  not  the  observed  difference  between  ad  and  be  is  due  to  sampling  variation  is  provided  by  a  chi- 
square  test  for  2x2  tables.  Mant el 'Haenszel  have  developed  a  cM-square  formula  specifically  for  use  in  testing  the  signifi¬ 
cance  of  a  relative  risk  correlation. 


IVfa) 

where 

r.(al 

.  NiM, 

N 

and 

Wa) 

N^(N-n 


If  ^  3.R4,  one  may  conclude  that  it  is  unlikely  that  the  difference  in  risk  between  the  group  with  and  the  group  without 
the  characteristic  is  a  result  of  chance. 

When  testing  for  significance  in  a  matched  fairs  example  such  as  in  Table  5,  the  McNemar  test  where: 


=  flt-gl-1)^  with  1  df 
t+S 

may  be  employed. 

In  establishing  confidence  limits  for  the  test  of  significance*  confidence  limits  of  the  logarithm  fto  the  base  el  of  a 
corrected  relative  risk  are  computed  and  the  logarithmic  confidence  levels  are  then  reconverted  to  the  original  scale.  The 
addition  of  0.5  to  the  numbers  a,b,c,  and  d  corrects  for  a  bias  which  can  occur  with  small  numbers  of  observations.  Using  the 
log-relative  risk  rather  than  the  relative  risk  itself  simplifies  calculations  of  standard  errors  necessary  for  computing  con¬ 
fidence  intervals. 

Using  the  Chi-Square  Test.  The  chi-square,  a  statistical  tool  familiar  to  most  psychologists,  has  two  basic  uses  in 
epidemiologic  research.  First,  the  chi-square  test  may  be  used  to  evaluate  whether  or  not  frequencies  which  have  been 
empirically  obtained  differ  significantly  from  those  which  would  be  expected  under  a  certain  set  of  theoretical  assumptions, 
that  is,  testing  the  null  hypothesis.  Second,  the  chi-square  test  nay  be  t  'ed  in  determining  the  degree  or  strength  of  an 
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assix'iation.  As  with  the  standardized  mortality  ratio,  the  c|'i-s/juare  is  based  upon  a  comparison  ot'  fAjserve<i  anti  ac(iecte<i 


freqv:eiicies.  ('hi-sqiiare  is  obtained  by  taking  the  square  of  the  difference  between  the  observed  anc  expected  fre^juenc f es  in 
each  cell  divided  by  the  expected  number  of  cases  in  each  cell: 


X“  =  when  all  observed  and  expected  frequencies  are  identical.  If  '*♦  thrnj  i  lie  null  liyput  In;s  is  is  confi  rnptl.  Tlu* 

greater  the  difference  between  observed  and  expected  frequencies,  the  greater  the  chance  the  null  hypotliesis  is  rejected. 
The  chi-square  for  a  "x2  table  may  be  calculated  using  the  formula 


( Had-bcj  ~N/1.*  >^.V 
-  r.  "  ^a+h)  fb-^cV  Ta  ‘c  •  Cd+lo 


In  the  case  of  n  2x2  table  where  any  c)f  the  CDcpected  frequencies  are  5  <ir  less,  a  correction  tV>r  continuity,  known  as  the 
Yates  Correction,  can  be  made  by  adding  or  ubiracting  fJ.S  from  the  observed  frequencies  in  order  to  reduce  the  m<>gniliMle  of 
the  chi-square.  The  usefulness  of  this  modification,  liowever,  has  been  subject  to  debate  Cef,  ,  fleiss,  Mantel  and 

Greenhouse,  l^OS;  Grizzle,  1^67;  Remington  .v  Schork,  197^0, 

A  common  fallacy  in  employing  the  chi-square  test  is  to  use  the  value  of  chi-square  itself  as  a  measure  of  the  degree  to 

which  a  disease  and  a  characteristic  are  associated  with  one  another,  f.ven  though  chi-square  is  excellent  as  a  measure  of 

the  significance  of  an  association,  it  does  not  indicate  the  degree  of  association  because  it  is  a  function  both  of  the  proj*- 

erties  of  the  various  cells  and  the  total  number  of  subjects  studied.  The  degree  of  association  present  is  really  only  a 

function  of  the  cell  proportion,  which  explains  why  relative  risk  is  used  as  a  measure  of  association. 

There  are,  however,  measures  based  upon  the  chi-square  test  which  do  provide  a  measure  of  the  degree  of  the  association 
between  an  illness  and  a  specific  characteristic,  one  such  measure  is  the  phi  coefficient.  The  phi  coefficient  or  0  gives  a 
numerical  value,  ranging  from  0  to  +I  for  a  relationship  between  two  variables  and  is  similar  in  meaning  to  a  correlation  co¬ 
efficient.  It  is  calculated  by  using  the  following  formula: 

(ad-bc)  (ad-bc) 

y7a+CT7a+cT7r<c77t+3T  =  /n^m^  = 

Another  measure  is  V  or  Cramer's  measure  and  is  calculated  by  using  the  formula: 

X^  0 

V"  =  NMin  fi^i,c-l)  Min  ( r^l,c-l) 

where  Min  ('r-l,c-l)  refers  to  either  r-l  or  c-1,  whichever  is  The  smaller. 

Another  measure  is  the  Pearson’s  contingency  coefficient  where: 

C  -  +  N 

Attributable  Risk.  A  fourth  measure  of  the  association  between  a  disease  and  a  particular  characteristic  is  "attribu¬ 
table  risk."  The  measure  was  initially  defined  in  terms  of  lung  cancer  and  smoking  as  the  maximum  proportion  of  lung  <  incer 
attributable  to  cigarette  smoking  (Levin,  1953),  It  is  expressed  as: 

b(r^l) 

AR  ^  b(r-l)+l 

where  r  -  relative  risk  of  lung  cancer  among  cigarette  smokers  as  compared  to  nonsmokers,  and  b  =  proportion  of  the  total 
population  classified  as  cigarette  smokers. 

The  effect  of  relative  risk  (r)  and  the  proportion  of  those  with  a  cha  met  eristic  in  the  population  (b)  on  the  values  of 
the  attributable  risk  are  shown  by  calculations  of  the  attributable  risk  for  different  values  of  r  and  b  in  Table  6.  Thus, 
’’...when  the  frequency  of  a  characteristic,  such  as  cigarette  ssioking  is  low  and  the  relative  risk  for  a  disease  among  clga- 


12 


rettc  smokers  is  nlso  low  only  n  small  proportion  of  the  rases  of*  disease  rtin  !>e  attributed  to  cigarette  smoking”  /" f,i  1  i eni'e) d, 
107e.,  p.  is?^.  The  reverse  is  true  when  relative  risk  and  the  prnporti«»n  of*  smokers  is  biglt.  Alt ri but abl risk,  therefore, 
allitws  one  to  estimate  the  eDctent  ti*  wbirb  a  piirtirular  «lise.ise  is  due  l  a  speriri<*  factor. 

IVible  (. 

At  t  ributalile  Risks  as  a  i'roporfion  f<ir  Selected  VaUies 


t>f  Rel.'it  iv«‘  Risk 

an«l  i‘r<qH»rt  ion  of  I’ 

ipulal  loll 
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l!ie  <;hii  nu'1  erist  ic 

r  Relative 

Risk 

b  -  ion  of  IVqnilation 

with  fharjjct cri Pi  ic  (  percent  ) 

4 
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.67 

86 
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‘HI 

.47 

.73 

80 

.91 
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.74 

90 
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At  t  ril'Utabl  e  risk  is  j-t  iou  la  r  I  >■  useful  l*or  the  study  of  iiiorlalily.  for  the  study  of  fertility  or  recurrent  diseases, 
however,  the  measure  is  limited  because  the  relative  risk  involved  is  the  ratio  of  two  probabilities  M'ark,  19R11.  I’ark  pro¬ 
vides  a  modification  of  the  attributable  risk  measure  that  is  suitable  for  recurrent  events. 

SUMMARY 

rven  a  brief  survey  of  epidemiologic  and  biomedical  research  reveals  the  use  of  statistical  methods  quite  familiar  to 
psychologists.  Hypotheses  are  formulated  and  tested  in  much  the  same  manner  and  chi-squares,  regression,  correlation,  and 
analyses  of  variance  are  commonly  employed  in  the  effort  to  study  the  relationships  between  morbidity,  mortality,  and  numer¬ 
ous  other  environmental,  physiological,  social,  cultural,  and  psychological  variables. 

There  are,  however,  key  statistical  concepts  widely  used  in  epidemiology  and  biostatistics  but  seld<xn  seen  in  psychology. 
Epidemiologic  studies  employ  the  use  of  rates  and  measures  of  association  which  indicate  the  degree  of  a  relationship  between 
a  disease  and  one  or  more  characteristics,  both  the  rates  and  the  measures  of  association  are  based  upon  statistical  con¬ 
cepts  and  principles  underlying  the  methods  utilized  by  psychologists.  Rates,  such  as  the  standardized  mortality  ratio  and 
the  incidence  rate,  are  measures  of  probability  in  which  a  group  of  people  is  compared  with  the  larger  population  over  a 
specified  peri<id  of  time.  Measures  of  association  such  as  the  relative  risk  and  phi  coefficient  are  grounded  in  the  compari¬ 
son  between  observed  and  expected  frequencies  on  which  the  chi-square  test  employed  by  psychologists  is  based. 

Despite  the  differences  in  terminology  and  the  freo"  use  of  rates  in  epidaniologic  and  biomedical  research  which  are 
not  found  in  psychology,  this  brief  review  indicates  t’  t  the  gap  between  biostatistics  and  psychological  statistics  is 
neither  large  nor  complex.  I'sychologists  should  be  able  to  readily  familiarize  themselves  with  this  new  ground  as  the  con¬ 
cepts  which  underlie  the  methods  of  epidemiology  and  biostatistics  are  also  found  in  the  statistical  methods  of  psychology. 
Other  than  an  understanding  of  the  terminology  onployed,  relatively  few  shifts  in  statistical  thinking  are  necessary  to 
attain  a  basic  comprehension  of  epidemiologic  findings. 
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